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hlltitu una.fej>fl conventional prime factor discrete Fourier transform (DFT) algorithm 
is used to realize a discrete Fourier-like transform on the finite field , GF(q n ). A pipeline 
structure is used to implement this prime factor DFT over GF(q n ). This algorithm is 
developed to compute cyclic convolutions of complex numbers and to decode Reed - 
Solomon codes . Such a pipeline fast prime factor DFT algorithm over GF(q*T) is regular , 
simple , expandable , and naturally suitable for VLSI implementation. An example illus- 
trating the pipeline aspect of a 30-point transform over GF(q n ) is presented. 
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I. Introduction 

Let GF(q n ) be a finite field, where q is a prime number. A 
discrete Fourier-like transform over GF(q n ) is defined by 

d-l 

A k = ^ a n ot nk mod q,0 d - 1 (1) 

n= o 

where A k , a n eGF(q n ), a is the c/th root of unity in GF(q n ) 
and d\q n - 1. 

In some applications of finite field transforms to coding and 
digital filters, the transform length d is not a power of two. 
Thus, the usual FFT algorithm cannot be used to calculate 
transforms of as many as d points. 

In this article it will be shown that if d = d i • d 2 . . . d r> 
where (d t , dj ) = 1 for i ±j , then the transform in Eq. (1) and 


its inverse can be calculated by the fast prime factor DFT 
algorithm (Refs. 1-3). The prime factor DFT algorithm is 
based on the use of the Chinese remainder theorem (Ref. 4). 
In this algorithm, the one-dimensional d -point Fourier trans- 
form over GF(q n ) is converted into an r -dimensional, d t - 
point, (for /- 1,2, . . . ,r) Fourier transforms over GF(q n ). 
This algorithm is called a prime factor DFT algorithm over 
GF(q n ). 

A primary advantage of the prime factor DFT is that the 
VLSI processor for computing a d -point transform over 
GF(q n ) is composed only of d x + d 2 + • • ♦ + d r basic cells. 
Each basic cell performs a sum and product operation in 
GF(q n ). 

It is well known that the systolic array of a DFT computa- 
tion requires d basic cells (Ref. 5). Therefore, the number of 
basic cells of a prime factor DFT algorithm over GF(q n ) are 
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substantially fewer than the systolic array implementation of 
the DFT. Of course, this advantage must be weighed against 
the difficulty of the rearrangement of two input and output 
sequences of d samples in the VLSI implementation. 

In this article, a pipeline structure for rearranging input 
and output data is developed for a prime factor DFT over 
GF(q n ). As a consequence, a pipeline fast prime factor DFT 
over GF(q n ) can be developed. Finally, it is shown that a 
pipeline fast prime factor DFT over GF( 2 8 ) can be used to 
decode a (255,223) Reed-Solomon Code (Ref. 6) and that a 
pipeline fast prime factor DFT over GF(q 2 ), where q = 2? - 1 
is a Mersenne prime, can be used to compute the cyclic convo- 
lutions of two sequences of 8 • p symbols. 
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II. A Pipeline Fast Prime Factor DFT 
Over GF(cf) 

In this section a method for computing a d-point DFT 
over GF(q n ), based on the Chinese remainder theorem, is 
investigated. 

Let d = d x • d 2 . . . d r , where (d g , d f ) = 1 for i ;, and also 
let k and n in Eq. (1) be represented by the r-tuple (Jc l , k 2 , 
. . . , k r ) and (n x , n 2 , . . . , respectively, where kj = k 
mod dj and rij = n mod dj for 1 < / < r. By the use of the 
Chinese remainder theorem, it is shown (Ref. 4) that a d-point 
DFT over GF(q n ) in Eq. (1) can be decomposed into an r, 
d-point, DFTs (/ = 1 , 2, . . . , r). In other words, the computa- 
tion of A k given in Eq. (1) can be written explicitly in terms 
of the d^th root of unit in GF(q n ) as 
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and (dj , dj) = 1 (for / = 1 , 2 , . . . , r) is a dj th root of unity i 
GF(q n ) (Ref. 6). 
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Instead of computing Eq. (3) directly, an alternate pipe- 
line technique can be developed to compute Eq. (3). To see 
this, let d = d x • d 2 , . . . , d r = d J 1 ) • d \ x ^ , where d\ - d x • 
d 2 , . . . , d r _ x , and (g?/ 1 ), g? 2 < 2 >) = 1. Thus, by Eq. (3) 

a one-dimensional transform of d samples can be converted 
into a two point transforms, for / = 1,2. In a similar 
fashion, the one-dimensional transform for dp) can be con- 
verted into two dp)-point transforms, for i = 1,2, where 
d\ 2 ^ = * d 2 , . . . , d r _ 2 and dp) = d r-1 , and so forth . This 

algorithm is called a pipeline prime factor DFT algorithm on 
GF(q n ). It is shown in the next section that this new algo- 
rithm can be readily implemented in VLSI technology. 


III. VLSI Design for Computing a Pipeline 
Fast Prime Factor DFT Over GF(cf) 

A VLSI processor for computing Eq. (1), composed of 
d basic cells, is now developed, using the direct approach. 
Each basic cell performs a sum and product operation in 
GF(q n ). That is, a «- a + b * c, where denotes the opera- 
tion “is replaced by.” The VLSI architecture of the DFT over 
GF(q n ) using a method similar to the systolic array technique 
is illustrated in the following example. 


From Eq. (2), this algorithm consists of the following r 
stages: 


Example 1 . Architecture of a 6-point DFT using the direct 
systolic array technique. 
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A 6-point DFT over GF(q n ) is defined by 

4 - E 

«= o 

where a is a 6th root of unity in GF(q n ). A VLSI architecture 
structure for computing this transform over GF(q n ) is shown 
in Fig. 1. This architecture contains six basic cells. The princi- 
pal function of each basic cell is to perform the operation a «- 
a + b • c. In this example a nk , which is stored in the registers 
A k for 0 < k < 5 , is assumed to be precomputed. 

First, the complex symbol sequence a n for 0 < n < 5 is 
sent to all of the cells, simultaneously. Assume initially that 
all registers are set to zero. After the input data are entered, 
the components of A k in Eq. (1) are contained in registers 
B k for 0 < k < 5 . The values computed in this manner are 
shifted sequentially out of register fi 0 . 

The application of Eq. (3) to the VLSI design of a d-point 
DFT is illustrated in the following example. 

Example 2. Architecture of a 6-point DFT given in Eq. (1) 
using Eq. (3). 

Since d = d x • d 2 = 3 • 2, by Eq. (3) one obtains a 6-point 
DFT as 


In this figure, the input sequence a n , first, is shifted sequen- 
tially into the memory A, which is composed of six registers 
A 0 , A x , . . . , and A s . Then, each column of memory A is 
shifted right out of the last column into the unit of the 2 -point 
DFT. Then each column of A is cyclically shifted up by one 
symbol, except for the first column. 

Simultaneously, a 2-point transform of the first column is 
computed by the direct systolic array method. These opera- 
tions are continued repeatedly until the first column of mem- 
ory A is shifted out of the circuit. The components of 
A ( n i k ^ are obtained in memory B. 

Finally, two 3-point transforms of the first and second rows 
of memory B are calculated by Eq. (4b). The components of 
A(k x ,k 2 ) are obtained in memory C (see Fig. 2). 

The method used to rearrange the output data from>l(fcj t k 2 ) 
back to A k is illustrated for Example A.l in the appendix. 
Each column of memory C is shifted up and out of the first 
row and then into the last stage. After this, each row of C is 
cyclically shifted right by one symbol except for the third 
row. These operations are continued repeatedly until the last 
row of memory C is shifted out of the circuit. 

The first three samples, A 0 through A 2 , are obtained by 
switching them to the output line of memory C. The next 
three samples are obtained by switching them to the other 
output line with a three-stage delay element (see Fig. 2). 


A \n v k 2 ) 12 a (n y n 2 ) a 2 2 
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If a structure similar to that used in Example 2 is used 
(4a) recursively, the general pipeline fast prime factor DFT algo- 
rithm over GF(q n ) can be obtained. This is illustrated by 
the following example. 
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Example 3. Architecture of a d = d x • d 2 - 5 X 6-point 
DFT over GF(q n ) using Eq. (3) 


where a 2 = a 3 and a, =a 4 are the 2nd and 3rd roots of unity jo compute this 30-point DFT over GF(q n ), first, one 

in GF(q n ). computes the two-dimensional 5X6 DFT over GF(q n ) by 


To compute Eq. x (4a), first, one maps the input index n 
into a pair of indices n, and n 2 , where n, = n mod 3 and 
rtj = n mod 2. That is, a 0 -*■ a (00) , a, ->• a (1 -+ (Oj 0 ), 

a 3 ( fl o,i)> fl 4 "*■ a (i,o)- and a s a (2,i)- The method u sed t( > 
convert a n to d( ni ,n 2 ) is described in Example A.l in the 

appendix. 

The pipeline structure for computing the 6-point DFT 
given above in Example 2 is shown in Fig. 2. This pipeline 
architecture makes possible an efficient design with fewer 
cells and a small loss in throughput rate. 
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where a x = a 6 and a 2 = a 25 are the 5th and 6th roots of unity 
in GF(q n ), respectively. 

To compute Eq. (5a), one needs to convert a n into a^ ni 
where n x = n mod 5 and n^-n mod 6. The procedure for 
rearranging this input sequence is given in Example A .2 in 
the appendix. Then, the 6-point DFT over GF(q n ) in Eq. (5a) 
can be decomposed into a two-dimensional 3X2 transform. 
This 2-D transform can be implemented by Example 2. 

A 5-point DFT over GF(q n ) in Eq. (5b) can be computed 
by the direct systolic array method. Finally, the resulting data 
,^ 2 ) is arran S ed in th e reverse manner back to A k (see 
Example A.2 in the appendix). A pipeline architecture for 
implementing a prime factor 30-point DFT over GF(q n ) is 
shown in Fig. 3. 

In Fig. 3, one observes that one needs to permute the 
index n 2 of the last column of d 2 = 6 symbols in matrix A 
and the index k 2 of the first row of d 2 =6 symbols in matrix 
C, where n 2 = n mod 6 and k 2 = k mod 6. Thus, in order to 
reduce the number of permutations in the hardware, one 
should choose d 2 < d x , where d = d x • <£,. Then, one needs 
only to permute the index ^ of the sequence of d 2 symbols. 

In Example 3, if one chooses d = d x • d 2 = 6 • 5, then one 
does not need to permute the index n 2 of the last column of 
d 2 = 5 symbols in a 6 X 5 matrix. In general, if d x - d 2 = 1, 
then no permutations are needed for index . 

IV. Applications 

In this section it is shown that a pipeline fast pipeline 
prime DFT over GF{q n ) has application to future designs of 
coding and digital signal processors. 

A. Coding 

It is shown (Ref. 6) that a 255-point transform over GF( 2 8 ) 
can be used to decode a (255,233) Reed-Solomon Code. 
Since the transform length of this transform is cf = 255 = 1 7 X 
5 X 3 = 17X 15 = • d 2 , the transform can be computed 

by an algorithm similar to that used in Example 2. Thus, a 
pipeline fast prime factor DFT algorithm over GF{ 2 8 ) can be 
used to decode a (255,223) Reed-Solomon Code. 

The arithmetic needed to compute the 255-point transform 
over GF( 2 8 ) in RS decoder requires only 3 + 5+17 = 25 
basic cells. Each basic cell performs a sum and product opera- 
tion over GF( 2 8 ). 


B. Digital Signal Processor 

Recently the authors (Ref. 7) defined transforms over 
GF(q 2 ) of the form 

d - 1 

A k = 12 “n m0d « 
n = 0 

where q = 2? - 1 is a Mersenne prime, a is a dth root of unit 
in GF{q 2 ) and d\q 2 - 1. They showed that the convolution of 
two finite sequences of d samples can be obtained as the 
inverse transform of the product of their transforms. 

Nussbaumer in Ref. 8 showed that the complex integer 
(1 + /) is an 8 • pth root of unity in GF(q 2 ). Multiplications 
by powers of (1 + f) can be performed by simple bit rotations. 
As a consequence, the operations needed to compute an 
8 • p-point transform over GF(q 2 ) requires only cyclic shifts 
and additions. 

If one uses the direct systolic array method, the VLSI 
processor needed to compute the above 8 • p-point transform 
over GF{q 2 ) would require 8 • p basic cells. Each basic cell of 
this transform performs only a sum and cyclic shift in GF(q 2 ). 

Before developing further the pipeline fast prime factor 
DFT over GF(q 2 ), consider further some properties of the 
finite field, GF(q n ). 

Theorem 1 . Let GF{q n ) be a finite field. Also let d = 
d" 1 • , . . . , where (d (i dj) ^ 1 for / =£ / and d\q n - 1 . 

Then, a is an element of order d in GF{q n ) if and only if 
a d/di ^ j mod f or / = 1 5 2 , . . . , r. 

Proof. If a is an element of order d , then d is the smallest 
integer such that 

oc d = 1 mod q 
This implies a d ^ di 4 1 mod q. 

Assume a is not an element of order d. Then, the order of 
a is 0(a) =£ d y where 0(a) denotes an order of a. Thus, there 
exists h ¥= l such that 

d = d" 1 -d" 2 = h- 0(a) (6) 

By (Ref. 6), one observes that d t \h for some i. Therefore, 
did . _ 

a 1 = 1 mod q 
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Corollary 1. If a is an element of order of d in GF(q n ), 
then ai is also an element of order d for (/, d) = 1 . 

Proof. Since ( j\d)= 1, then d i Jrj\ where d- is a prime factor 
of d. Thus, 

.did. d/d. . 

(a 7 ) 1 = (a ) } = (a.) 7 mod q 

where a,. = is an element of order d t and a - d/d t . 

Assume 


Corollary 4. Let GF(q 2 ) be a Golois field, where q = 2 P - 1 
is a Mersenne prime. An 8th root of unity in GF(q 2 ) is in one 
of forms ±2 (P -1 )/ 2 (1 ± i) mod q. 

Proof. First note the identity 

(2 (p “i)/ 2 (l +0)4 = 2 2(p ”i)(l +/) 4 =2’ 2 (-2 2 )= -1 mod q 

Thus, by Theorem 1, it follows that 2 (p- 1 )I 2 (1 + /) is an 8th 
root of unity in GF(q 2 ). Also, by Corollary 1 , (2 (p- 1 )/ 2 (1 + /)) 7 
for / = 3, 5, 7, are 8th roots of unity in GF(q 2 ). That is, 


a 7 = 1 mod q 

Then, / must be a multiple of d { . But (j, d t ) = 1 . Hence (a°y ^ 1 
mod q , i.e., (oJ) d ! di =£ 1 mod q. By Theorem 1, a> is an ele- 
ment of order of a for (/, d) = 1. 

From Corollary 1, one observes that the number of ele- 
ments of order d in GF(q n ) is <p(d) y where <t>(d) denotes 
Euler’s Function. 

Corollary 2. Let GF(q 2 ) be a Golois field, where q = 2 p - 1 
is a Mersenne prime, an 8 • pth root of unity in GF(q 2 ) is 
(1 + *). 

Proof. First note the identity 

(1 + if p/p = (l + if = 2 4 4 1 mod 
and 

(1 + /)8P/2 = (j + ifP = (Jl 2 f = -2 2p 
= -1 =£ 1 mod (7 


( 2 ( p -» V 2 (i + /)) 3 = 2 < p - 1 * 2 (1 + /) 2 P_1 (1 + 0 2 
= 2^ P “ 1 ^ 2 (-1 +/)mod^ 

(2^-» 2 (1 + Of = 2< p -'* 2 (1 + 0 ( 2 {p ~ iy2 (1 + Of 

= _2^P -1 )/ 2 (1 + /) mod q 
and 

| 2 (P-1V2 (j + = 2 (p_1V2 (l + 0 (2 (p_iy2 (1 + o) 6 

= _ 2 ( p - iV 2 (_! + 0 

Therefore, an 8th root of unity in GF(q 2 ) is in one of forms 
±2(p-W (1 ±/)mod<7. 

By Corollary 2, for p = 31, (1 + i) is a 31 • 8th of unity in 
GF(q 2 ). If we let d = d x • d 2 = 31 *8, then a 31 • 8-point 
transform over GF(q 2 ) is 

3 1 » 8— 1 

^ ■ Z) <7) 

n = 0 


Thus, by Theorem 1, it follows that (1 + i) is an 8 • p th 
root of unity in GF(q 2 ). 


Corollary 3. Let GF(q 2 ) be a Golois field, where q = 2p - 1 
is a Mersenne prime, 2 7 is a p th root of unity in GF(q 2 ) for 
7=1,2,... ,p-l. 

Proof. First note the identity 


2 Pf p =2^1 mod q 


Thus, by Theorem 1 , it follows that 2 is a pth root of unity in 
GF{q 2 ). Also by Corollary 1, 2 7 is also apth root of unity in 
GF(q 2 ) for (j , p) - l,i.e.,/= 1,2, . . . ,p- 1. 


where a = (1 + i) is a 8 • pth root of unity in GF(q 2 ). 

From Eq. (3), the transform given in Eq. (7) consists of the 
following two stages: 


and 


8— i 


4 V * a > = ? a (» r » 2 ) a " 2 " 2modi7 (8a) 


"2 = 0 


31-1 


A k = A ik v k 2 ) = £ m ° d « ( §b ) 


"l = ° 
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where a 2 = (1 + z ) 217 = 2 15 (1 - z ) and = (1 + z ') 32 = 2 12 . 

If one uses an algorithm similar to that used in Example 3, a 
pipeline structure can be used to implement Eqs. (8a) and 
(8b). For p = 31, the VLSI processor for computing an 
8 • p-point transform over GF(q 2 ) is only composed of 
8 + 31= 39 basic cells. Each basic cell in this transform 
performs only a summation and cyclic shifts. 

V. Concluding Remarks 

It has been shown that a conventional prime factor dis- 
crete Fourier transform (DFT) algorithm can be used to 
realize a discrete Fourier-like transform on the field GF(q n ). 


This algorithm can also be used to compute cyclic convolu- 
tions of complex numbers and to decode Reed-Solomon codes. 
A primary advantage of the prime factor DFT is that the 
number of VLSI processors needed is substantially fewer than 
that of the more conventional systolic array implementation. 
The difficulty of the rearrangement of two input and output 
sequences of samples in the VLSI implementation was shown 
to be a soluble problem. As a consequence, a pipeline fast 
prime factor DFT over GF(q n ) can be developed. 

It is possible to use the pipeline fast prime factor DFT 
over GF( 2 7 8 ) in decoding a Reed-Solomon code. Further- 
more, a pipeline fast prime factor DFT over GF(q 2 ), where 
q = 2 p - 1 is a Mersenne prime, can be used to compute the 
cyclic convolutions of two sequences of 8 * p symbols. 
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CELL 1 CELL 2 CELL 3 CELL 4 CELL 5 CELL 6 

Fig. 1. The systolic array to compute a 6-point DFT using the direct method 



A 

Fig. 2. A pipeline architecture for the implementation of a fast prime factor 6-point DFT over GF(q n ) 
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Fig. 3. A pipeline architecture for the implementation of a fast prime factor 30-point DFT over GF(q n ) 











Appendix 

Algorithm for Rearranging the Input and Output Sequences 
in the Prime Factor DFT Over GF(q") 


To perform Eq. (3), one needs to convert input data from 
a n to a(n x ,n 2 ) an d inversely to convert output data from 
^i k \> k 2 ) t0 where 0 < n, k < d = d x • d 2 - l.ty mod 
dj and k t - k mod d^fori= 1 , 2. To see this, let the one-dimen- 
sional input data of DFT be a n , where 0 Kn<*d- d x ' d 2 - 1 . 
This one-dimensional input data can be shifted sequentially 
into a d x X d 2 two-dimensional matrix as follows: 


V 



fl o 


• • • 5 **2^+2 

V 

X 

a 3d x - 1 

' • ‘ ' a 3d x + 2 

a 2d x + 1 

V 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

a d 2 d x -1 

• ‘ * ’ a (d 2 -l)d 1 +2 

fl (d 2 -l)d 1+ l 

v 2 


(A-l) 

In (A-l) the input index n is mapped into the pair of indices 
(rtj , Atj), where n x = n mod d x and = n mod d 2 . The vector 
of index n 2 of the last and next to last columns of the 
matrix in (A-l) are, respectively. 


[0 *d x mod d 2 , 1 • d y mod d 2 , 2 * d y mod d 2 , . . . , (d 2 - 1) 
• d x mod d 2 ] T 

(A -2) 


and 


[(0 • d x + 1) mod d 2 ,(1 • d { + 1) mod ^,(2*^ + 1) 
mod d 2 ,. . . y ((d 2 - 1) ' d x + l)mod d 2 ] T 


(A-3) 


Then, one can find 1 in (A-3) and some value i in (A-2) 
such that 


1 = i 


d x mod d 2 


(A4) 


Given d x and d 2 , the solution of congruence (A4) is i. The 
index is periodic with period d 2 . If the indices in (A-3) 
are cyclically shifted up by i or down by d 2 - / , the index 
of the resulting sequence is equal to the index n 2 of the 
sequence in (A-2). In a similar fashion, if the data of the kih 
column of (A-l) is cyclically shifted up or down by i or by 
d 2 - i , then the index n 2 of the resulting data of the Arth 
column is equal to the index n 2 of the data of the (k + l)th 
column of the matrix in (A-l). To illustrate this, two examples 
are presented. 

Example A. 1. Let the length of input sequence be d = 
d\ * d 2 = 3 • 2. Convert a n into ci(n iy n 2 ) an d inversely con- 
vert v4( rtl> „ 2 ) into A n , where 0 < n < 5, n x = n mod 3 and 
= n mod 2. First, the input sequence a n , 0 < n < 5, is 
shifted sequentially into a 3 X 2 matrix as follows: 


fl 2 

*1 

a o 

a 5 

% 

h 


(A-5) 


Then, the index of is mapped into two pairs of indices 
(n x , a^). Thus, the matrix in (A-5) becomes 


V.O) 

Vi) 

a (0,0) 

V.O 

V<>) 

Vo 


(A-6) 


where T denotes matrix transpose. 


The index of the last and next to last columns of (A-5) are 
[0 • 3 mod 2,1*3 mod 2] T (A -7) 

and 

[(0*3+ 1) mod 2, (1 • 3 + 1) mod 2] T (A-8) 
respectively. 
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From (A-4), one obtains the congruence as 

1 = / * 3 mod 2 (A-9) 

The solution of (A-9) is i = 1 . Let the data of the first and 
second columns of the 3 X 2 matrix in (A-6) be cyclically 
shifted up by two and by one, respectively. Then, the 2-D 
matrix is arranged in the same order as the index of the 
last column: 


“(2,0) 

“(1,0) 

“(0,0) 

“(2,1) 

"(1.1) 

s. 

o' 

ST 


This matrix is used to input data for a prime factor DFT 
algorithm over GF(q 2 ) after taking three 2-point transforms 
of the columns and then taking two 3-point transforms of the 
rows in (A-10); then, (A-10) becomes 


j4 (2,0) 

^(1,0) 

^(0,0) 

^(2,1) 

^(M) 

^(0,1) 


To inversely convert 2-D matrix in (A-ll) into 1-D A n for 
0 < n < 5, first, the data of the first and second columns of 
the 3X2 matrix in (A-l 1) are cyclically shifted down by two 
and by one, respectively. Then, the matrix in (A-l 1) becomes 


A 

(2,0) 

^(1,1) 

^(0,0) 

A 

(2,1) 

^(l.O) 

^(0,1) 


^2 

A l 

A o 

^5 

A 4 

A 3 


Finally, is shifted out sequentially from this matrix. 

Example A.2. Let the length of input sequence be d = 
d x • d 2 - 5 • 6. Convert a n into and inversely con- 

vert from output sequence ^(^,* 2 ) into A k , where 0 < w, 
k < 30, Ti- = n mod 5, and k i = k mod 6 for i - 1 , 2. 


First, the input sequence a n , 0 < n < 30 is shifted sequen- 
tially into a 5 X 6 matrix as follows: 


°4 

a 3 

*2 

*1 

*0 

°9 

*8 

a 7 

*6 

a 5 

a i4 

fl 13 

“ 12 

*11 

*10 

a i9 

“ l 8 

*17 

*16 

a i5 

a 24 

a 23 

*22 

a 2\ 

d 20 

a 29 

fl 28 

a 21 

*26 

*25 


(A-l 2) 

If the index of a n is mapped into the pair of indices (n x , n 2 ), 
where n x = n mod 5 and n 2 = n mod 6, then the matrix in 
(A-12) becomes 


*( 4 , 4 ) 

“( 3 , 3 ) 

*( 2 , 2 ) 


Vo) 

*( 4 , 3 ) 

*( 3 , 2 ) 

*( 2 , 1 ) 

Vo) 

*( 0 , 5 ) 

*( 4 , 2 ) 

a ( 3 , l ) 

*( 2 , 0 ) 

* 0 , 5 ) 

*( 0 , 4 ) 

“(4,1) 

a ( 3 , 0 ) 

*( 2 , 5 ) 

*( M ) 

%, 3 ) 

*( 4 , 0 ) 

"( 3 , 5 ) 

*( 2 , 4 ) 

* 0 , 3 ) 

“( 0 , 2 ) 

*( 4 , 5 ) 

*( 3 , 4 ) 

fl ( 2 , 3 ) 

fl ( 1 , 2 ) 

“( 0 , 1 ) 


(A- 13) 

In order to arrange the second index of a pair (Wj,^), i.e., 
n 2 in order, from (A-12), one solves the congruence 


1=1*5 mod 6 (A-l 4) 


The solution of (A-l 4) is / = 5 or - / = 6 - 5 = 1 . If the data 
of the 1st, 2nd, 3rd, and 4th columns of (A-3) are cyclically 
shifted down by 4, 3, 2, and l,the columns of the 5 X 6 matrix 
in (A-l 3) are rearranged in the same order as the last column 
of (A-13) as follows: 
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“(4,0) 

“(3,0) 

“(2,0) 

“(1,0) 

o 

© 

“(4,5) 

“(3,5) 

“(2,5) 

“(1,5) 

“(0,5) 

“(4,4) 

“(3,4) 

“(2,4) 

“(1,4) 

“(0,4) 

“(4,3) 

“(3,3) 

“(2,3) 

“(1,3) 

“(0,3) 

“(4,2) 

“(3,2) 

“(2,2) 

“(1,2) 

“(0,2) 

“(4,1) 

“(3,1) 

“(2,1) 

“(1,1) 

“(0,1) 


(A-15) 

Finally, if one arranges the rows so that the index is in 
numerical sequence, (A-15) becomes the desired result, namely, 


“(4,0) 

“(3,0) 

*(2,0) 

“(1,0) 

*(0,0) 

“(4,1) 

“(3,1) 

*(2,1) 

“(1,1) 

“(0,1) 

“(4,2) 

“(3,2) 

*(2,2) 

“(1,2) 

“(0,2) 

“(4,3) 

“(3,3) 

“(2,3) 

“(1,3) 

*(0,3) 

*(4,4) 

*(3,4) 

“(2,4) 

*(1,4) 

*(0,4) 

“(4,5) 

*(3,5) 

“(2,5) 

“(1,5) 

*(0,5) 


(A-16) 

After taking five 6-point transforms of the columns and then 
taking six 5-point transforms of the rows in (A-16), (A-16) 
becomes 


A 

^(4,0) 

^(3,0) 

^(2,0) 

^(1.0) 

^(0,0) 

^(4,1) 

^(3,1) 

^(2,1) 

^(1,1) 

^(0,1) 

^(4,2) 

^(3,2) 

^(2,2) 

^(1,2) 

A 

(0,2) 

^(4,3) 

A 

(3,3) 

^(2,3) 

^(1,3) 

^(0,3) 

^(4,4) 

A 

(3,4) 

^(2.4) 

A OA) 

— 71 
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^(4,5) 

A 

(3,5) 

^(2,5) 

A ( 1 ,5) 

^(0,5) 


(A-17) 


To rearrange from A( k lt k 2 ) in (A-17) into A k , first one 
arranges the rows in (A-17) such that the index n 2 of 5 X 6 
matrix in (A-17) are in the same order as the index in (A-15) 
as follows: 


^(4,0) 

^(3,0) 

A 

^(2,0) 

^(1,0) 

^(0,0) 

^(4,5) 

^(3,5) 

^(2,5) 

^(1,5) 

^(0,5) 

^(4,4) 

^(3,4) 

^(2,4) 

A ( 1 .4) 

A 

^(0,4) 

j4 (4,3) 

^(3,3) 

A 

^(2,3) 

A ( 1 ,3) 

^(0,3) 

^(4,2) 

^(3,2) 

^(2,2) 

A ( 1 ,2) 

^(0,2) 

^(4,1) 

^(3,1) 

^(2,1) 

^(M) 

^(0,1) 


(A- 18) 

Then, the 1st, 2nd, 3rd, and 4th columns of the matrix are 
cyclically shifted up by four, three, two, and one data, respec- 
tively. Thus, one obtains 


^(4,4) 

^(3,3) 

^(2,2) 

^(1,1) 

^(0,0) 

^(4,3) 

^(3,2) 

^(2,1) 

^(1,0) 

^(0,5) 

^(4,2) 

^(3,1) 

^(2,0) 

A ( 1 ,5) 

^(0,4) 

^(4,1) 

A 

^(3,0) 

^(2,5) 

A 

^(1,4) 

^(0,3) 

A 

^(4,0) 

^(3,5) 

^(2,4) 

A 

(1,3) 

^(0,2) 

^(4,5) 

^(3,4) 

A 

(2,3) 

A ( 1 ,2) 

^(0,1) 


(A- 19) 

Finally, A k is shifted sequentially out from this matrix. 
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